Optimised Learning from Demonstrations for Collaborative Robots
نویسندگان
چکیده
The approach of Learning from Demonstrations (LfD) can support human operators especially those without much programming experience to control a collaborative robot (cobot) in an intuitive and convenient means. Gaussian Mixture Model Regression (GMM GMR) are useful tools for implementing such LfD approach. However, well-performed GMM/GMR require series demonstrations trembling jerky features, which challenging achieve actual environments. To address this issue, paper presents novel optimised improve clusters then further so that enabled cobots carry out variety complex manufacturing tasks effectively. This research has three distinguishing innovative characteristics: 1) noise strategy is designed scatter with features better the optimisation GMM/GMR; 2) Simulated Annealing-Reinforcement (SA-RL) based algorithm developed refine number eliminating potential under-/over-fitting issues on 3) B-spline cut-in integrated GMR adaptability reproduced solutions dynamic tasks. verify approach, cases studies pick-and-place different complexities were conducted. Experimental results comparative analyses showed exhibited good performances terms computational efficiency, solution quality adaptability.
منابع مشابه
Learning from Limited Demonstrations
We propose a Learning from Demonstration (LfD) algorithm which leverages expert data, even if they are very few or inaccurate. We achieve this by using both expert data, as well as reinforcement signals gathered through trial-and-error interactions with the environment. The key idea of our approach, Approximate Policy Iteration with Demonstration (APID), is that expert’s suggestions are used to...
متن کاملRobot Learning from Failed Demonstrations
Robot learning from demonstration (RLfD) seeks to enable lay users to encode desired robot behaviors as autonomous controllers. Current work uses a human’s demonstration of the target task to initialize the robot’s policy, and then improves its performance either through practice (with a known reward function), or additional human interaction. In this article, we focus on the initialization ste...
متن کاملLearning Options for an MDP from Demonstrations
The options framework provides a foundation to use hierarchical actions in reinforcement learning. An agent using options, along with primitive actions, at any point in time can decide to perform a macro-action made out of many primitive actions rather than a primitive action. Such macro-actions can be hand-crafted or learned. There has been previous work on learning them by exploring the envir...
متن کاملReinforcement Learning from Imperfect Demonstrations
Robust real-world learning should benefit from both demonstrations and interaction with the environment. Current approaches to learning from demonstration and reward perform supervised learning on expert demonstration data and use reinforcement learning to further improve performance based on reward from the environment. These tasks have divergent losses which are difficult to jointly optimize;...
متن کاملLearning Skills from Human Demonstrations
Many robots are designed for use in domestic environments where robots will be engaged in household chores. The robots need to learn ways to do the household chores that humans are now doing. We are taking a learning from demonstration (LfD) approach to this problem [1]. In terms of the household chores, a number of tasks are developed so far; for example, bringing a beer bottle from a refriger...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Robotics and Computer-integrated Manufacturing
سال: 2021
ISSN: ['1879-2537', '0736-5845']
DOI: https://doi.org/10.1016/j.rcim.2021.102169